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Background 

(the way things are now) 



♦ No WO Integration 

• lots of PCB area, silicon & software 

• no common architecture 

♦ Hard to change 

« no realtime transport 

performance not scalable 
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Goals 


♦ Low cost, high performance 
ergonomic peripheral bus 

♦ Read/write memory architecture 

• NOT an I/O channel 

♦ Compatible architecture with other 
IEEE busses 

• Follow IEEE 1212 CSR (Control and 
Status Register) standard 

Isochronous service 
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“Isochronous” ?? 


♦ Iso (same) chronous (time): 

• Uniform in time 

• Having equal duration 

• Recurring at regular intervals 


Data Type 

Sample size & rate 

Bit rate 

ISDN 

8 kHz x 8 bits 

64 kbps 

CD 

44.1 kHz x 16 bits x 2 channels 

1.4 Mbps 

DAT 

48 kHz x 16 bits x 2 channels 

1.5 Mbps 

Video 

variable to 30 fps 

1.5 - 216 Mbps 
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Unsupervised! 


Terminator 


Mag Disk 

Scanner 

CD ROM 

CPU 


ID=6 

ID=2 

ID=3 

H > d = 7 


fl I_ 

II 

II 


Terminator 


SCSI is typical ‘‘supervised cabling ’’ — daisy-chain; manual or fixed addresses; 
terminators at ends; devices with internal terminations must be at one end 


Stereo 

Interface 


CPU 


Mag Disk 


CD ROM 



Serial Bus is “unsupervised cabling ! — “non-cyclic network”; 
automatic address selection, no terminators, locations are arbitrary 
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Data paths (peer-to-peer) 


Stereo 

Interface 


CPU 



Live display of video image 


CD 

ROM 

1 



uig 

Cam 


Digital 

Camera 


Scanner 


Mag 

Disk 


Printer 


Direct printing of scanned image 
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Clean up the desktop cable mess! 



plus telephone/voice, 
sound input, HiFi sound, 
compressed video 



© 1996 Apple Computer, Inc 


Michael Johas Teener / 1394 Technical Summary 

(Permission to copy granted as long as this notification is retained.) 


Slide 7 


Serial 













































Protocols 

♦ IEEE 1394-1995 High Speed Serial Bus 

• “Memory-bus-like” logical architecture 

• Serial implementation of 1212 architecture 

♦ IEEE 1212-1991 CSR Architecture 

• Standardized addressing 

• Well-defined control and status registers 

• Standardized transactions 

♦ X3T10 Serial Bus Protocol and IEEE P1285 

• Integration of DMA into I/O process 

• SBP is based on SCSI-3 CDB structures 

■■ 

IEEE P1285 treats disk as software-managed slow cache 
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IEEE 1212 addressing 


bus # 0 


bus # 1 


bus #1022 


bus #1023 
(local bus) 


node # 0 


node # 1 


node # 62 


node # 63 
(broadcast) 



initial 

memory 

space 


private 


register 



IEEE 1212 


Serial Bus 


ROM 
(1st IK) 


initial 

units 

space 


2048 


256M 


LL 

LL 

CO 

X 

o 

0x3F 

OxFFFFF 

0x0000200 


= all cycle timer registers on local bus 


The serial hus uses “64j*bit, fixed” addressing 
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Serial Bus Management 


IEEE 1394 protocol Stack 




[•1VMIUK*] 


Isochronous 

Channels 


Transaction Layer 


Link Layer 



Cycle Control Packet Transmitter Packet Receiver 


Physical Layer 


Encode/Decode ■ Arbitration ■ Media Interface 


Firmware 
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Cable interface 


Cable Media 


Cable 
Interface 
(port n) 


arbitration and resynch 


Link Interface phy b 


Serial Bus 

Higher Layers Node B 


Cable Cable 

Interface Interface 
(port 1) (port 2) 


arbitration and resynch 


Link Interface 


Serial Bus Higher Layers 
(Link & Transaction) 


Cable 
Interface 
(port n) 


PHY A 


Node A 


PHY transforms point-to-point cable links into a 
logical bus 

Cables and transceivers are bus repeaters 
Baseline limit of 4.5 m between nodes 

farther possible with thicker cables or <400 Mbit/sec 
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Cable media 




♦ 3-pair shielded cable 

• Two pairs for data transport 

• Ope pair for peripheral power 

♦ Small and rugged connector 

• Two sockets in the same area 
as one mini-DIN socket 


♦ CMOS transceiver 



220 mv differential 
4 ma drive 

< 500 pt worst case jitter 
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Cable media example 


▲ 

6mm 

(typ) 

▼ 

♦ Capable of operation at 400 Mbit/sec for 4.5 m 

♦ Slightly thicker wire allows 10 meter operation 



Power pair: 

22 AWG /0.87 Dia twisted pair 

60% braided shield over foil shield 
(over signal pairs - 2X) 

Signal pairs: (2X) 

28 AWG/0.87 Dia twisted pairs 

97% braided overall shield 
0.70 thk PVC jacket 
Fillers for roundness (if req'd) 
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Cable interface features 


♦ Live attach/detach 

• System protected from power on/off 
cycling 

• Higher layers provide simple 
management 
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Peripheral power 


♦ 8-40 VDC carried by cable 

♦ Total available power is system 
dependent 

• Node power requirements must be 
declared in configuration RO iyL „ 

♦ Cable system allows up to 1.5 A (60 
watts) per link 

• Nodes can pither source or sink power 



Multiple power sources on one bus 
provide additional flexibility 
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Physical layer 


98.304 Mbit/sec half duplex 
transport 

• Data reclocked at each node 

• 196.608,393.216,... Mbit/sec growth paths 

Data encoding 

• Data and strobe on separate pairs 

• Automatic speed detection 

Fair and priority access 

• Tree-based handshake arbitration 
Automatic assignment of addresses 
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Example cable PHY IC 


11 


11 


11 


Local 

Clock 


Resynch 


Link 

Interface 


Arbitration 
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Two twisted pairs 
for data: TP A and 
TPB 


RxData 

RxClock 

TxData 

TxClock 

aRxArbState 

■TxArbState 


TPA is transmit strobe, 
receive data 

TPB is receive strobe, 
transmit data 

Both are bidirectional 
signals, both are used in 
arbitration 


Reclocks repeated 
packet data figpals 
using local clock 
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Cable arbitration phases 

♦ Reset 

• Used whenever reconfiguration needed 

• Live insertion & new cycle master are examples 

♦ Tree Identification 

• Transforms a simple net topology into a tree 

♦ Self Identification 

• Assigns physical node number (Node ID) 

• Exchange speed capabilities with neighbors 

♦ Normal Arbitration 

• Root has highest priority 
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Tree identification #2 



♦ After Tree ffl|process, the Root nolle is 
determined and each port is labeled as 
pointing to a child or a parent 
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Self identification 



♦ After the self ID process, each node has a 

unique physical node number, and the topology 
has been broadcast 
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Normal arbitration #1 




Suppose nodes #0 and #2 start to 
arbitrate at the same time, jihey both 
send a request to their parent... 



© 1996 Apple Computer, Inc 


Michael Johas Teener / 1394 Technical Summary 

(Permission to copy granted as long as this notification is retained.) 


Slide 22 




























Normal arbitration #2 



♦ The parents forward the request to their 
parent and deny access to their other 
children... 
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Normal arbitration #4 


grant 







♦ The winning node #0 changes its request to a 
data transfer prefix, while the loosing nolle #2 
withdraws its request... 


© 1996 Apple Computer, Inc 


Michael Johas Teener / 1394 Technical Summary 

(Permission to copy granted as long as this notification is retained.) 


Slide 25 














Normal arbitration #5 



The parent of node 1 sees the data prefix and withdraws 
the grant, and now all nodes are correctly oriented to 
repeat the packet data (a "deny" is a "data prefix!)... 
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Link layer 


♦ Implements acknowledged 
datagram service 

• Called a "subaction" of arbitration, packet 
transmission, and acknowledge 

♦ Flexible addressing using 1212 
architecture 

• Direct 64-bit addressing (48 bits per node) 

• Hierarchical addressing for up to 63 nodes 
on 1023 busses 
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Isochronous transport 


♦ ffiptional 

• But required for multimedia applications 

♦ Multiple "channels" each 125 jisec 
"cycle" period 

• Channel count limited by available bandwidth 

♦ Variable channel size up to *1000 
bytes/cycle 

• Up to -2000 bytes/cycle at 196 Mbit/sec 
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Link Bayer operation 


Requester 
Link layer 


Responder 
Link layer 


Link request 


Link 
confirmation 


arbitration & 
packet 
transmission 


acknowledge 
(not present for 
broadcast or 
isochronous) 


Link indication 


Link response 



Example packets 



16 

16 

16 

48 

32 

32 

arb 

dest 

ID 

codes 

source 

ID 

destination 

offset 

32-bit 

data 

CRC 


gap 


> 1 |isec 

< 1 nsec 


acknowledge g 
gap 


ack 


subaction 

gap 


4-byte packet 


0.2 nsec 


160 bits @ 98.304 Mbit/sec ~1.6 jisec 


< 0.75 nsec 


0.5 nsec 


> 1 nsec 




subaction 

gap 


160 




2048 


32 


data 


\r 


subaction 
gap 


256-byte packet 


2240 bits @ 98.304 Mbit/sec ~ 22.8 |isec 


♦ Actual efficiency very good 

• 10 Mbyte/sec information throughput including all of the 
“SBP” SCSI-3 protocol using 100 Mbit/sec rate (-80%) 
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Fairness interval 



fairness 
interval N-1 


owner A 


fairness interval N 
owner B 


owner M 


Jsl_ 

arb data ||| j arb 

data | • • • f] 

/' 

subaction ►] 

^ f / 

/ 

arbitration 

subaction gaps 

reset gap 



arb 


data 


fa fairness 
interval N+1 


arb 


\ 


arbitration reset gap 


♦ Jfairness Interval is bounded by “arbitration 
reset gaps” 

♦ Reset gaps are longer than normal subaction 
gaps 


© 1996 Apple Computer, Inc 


Michael Johas Teener / 1394 Technical Summary 

(Permission to copy granted as long as this notification is retained.) 


Slide 31 




Fair arbitration 



fairness 
interval N-1 


fairness interval N 


arb | node A | | arb | node B~| | arb | node C 





fairness 
interval N+1 


=■ 



i”* ®\i W / - 

set at arbitration reset gap cleared when node wins arbitration 


♦ Bach node gets one access opportunity 
each Fairness Interval 

special case for isochronous data 
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Cycle structure 


ioschronous asynchronous 



nominal cycle period = 125 |isec ±100 ppm 
cycle synch cycle synch 


♦ Scott Sjnyers of Sony to elaborate 
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Transaction layer 


Requestor 

Transaction 

Layer 


Responder 

Transaction 

Layer 


Transaction 

Request 


Transaction 

Confirmation 


fran 


Action nn 

includes data if 
"write" or "lock" 

action stiti° s 

van Sotvn^ on 

includes data if 
"read" or "lock" 


Transaction 
Indication 


Transaction 

Response 
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Multiple transaction types 


♦ Simplified 4-byte (quadlet) read 
and write are required 

♦ Variable-length block read and 
write are optional 

♦ Lock transactions optional 

• Swap, Compare-and-swap needed for 
bus management 



© 1996 Apple Computer, Inc 


Michael Johas Teener / 1394 Technical Summary 

(Permission to copy granted as long as this notification is retained.) 


Slide 35 




Efficient media usage 


Split transactions required 

• Transactions have request and response 
parts 

• Bus is never busy unless data is actually 
being transferred 

Request and response can be unified 

two ways 

• "Read" and "Lock" can have 
concatenated subactions 




Write" can itave immediatecompletion 
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Split transaction 


Requester Responder 


Request and 
response have 
separate 
subactions 


Other Link-Layer operations can 
take place between these two 
subactions, including sending other 
transaction requests or responses 


Read 
Request' 


Transaction Link 
Layer Layer 


Transaction 

Layer 



Read 
Indication 


Read 

Response 
(complete, 
with data) 


Read 
Confirmation 
(complete, with 
data) 



Concatenated transaction 


^HJscd if 

responder is fast Read 
enough to return Request 

data before ack 
is completed 

the responder does not release the bus 
after sending the ack, sends response 
packet within 1.5jisec 


Read 
Confirmation 
(complete, with 
data) 


Requester Responder 


Transaction Link Link 
Layer Layer Layer 

Link 

nequesx | Request 


Transaction 

Layer 



Conf 

(pend) 


Link 
Indication 


Packet 


t 


Ack 

(pend) 

Resp 

Packet 



Resp 

(compl) 


Ack 

(compl) 


Link 

Indication 



Resp 

(pend) 

Link 

Request 


Conf 

(compl) 


Read 
Indication 

Read 
Response 
(complete, 
with data) 



Unified transaction 


Requester Responder 

Transaction Link Link Transaction 


Layer Layer Layer Layer 


Write Request 


Link 



Request 

Packet 


Link 

Indication 


Write Confirmation 



(complete) 


Write Indication 

Write Response 
(complete) 


♦ Only used for write transactions 





Bus management 

♦ Automatic address assignment 

• Done in physical layer 

♦ Standardized addresses and configuration 
ROM from IEEE 1212 architecture 

♦ Resource management 

• Isochronous channels and bandwidth 

• Power 

♦ Peter Johanssen of Congruent to elaborate 
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Futures 


♦ Gigabit rates/longer distances 

• 1-8 Gbit/sec, 25+ meter hops 

♦ Incremental addition of nodes 
without bus reset 

♦ Advanced power management 

♦ Bridging issues 

• for > 63 derices, or for isolation of 
high-bandwidth local traffic 
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How does 1394 help? 

♦ Much better human interface 

• smaller, more rugged connectors with 
defined usage 

• Hot plugging, no manual configuration 

♦ Excellent real performance 

• High true data rates 

• Direct map to processor I/O model 

• DMA is simple: CPU memory directly 
available to peripherals 

• examples: SBP and IEEE 1285 support direct 
scatter/gather buffers 
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... but even more important 

♦ It's inexpensive 

• For computers, it’s almost as cheap as 
single-ended 8-bit sCSiUM 

• will be cheaper since it's silicon-intensive 

• Much less expensive for peripherals 
and consumer electronics 

♦ Direct support for isochronous data 

• Likely choice for digital consumer 
video, high-end audio 

• Media servers get cheaper 
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Getting documentation 

♦ “IEEE 1394-1995 High Performance Serial Bus” 

• IEEE Standards Office 1-1-908-981-1393 


♦ Internet 

• ftp://ftp.apple.com/pub/standards/pl394 

• Drafts in FrameMaker 5 or Acrobat "pdf" format 

• Papers, email digests, performance reports 

♦ Internet email reflector 

• “P1394@Sun.COM" 

• Administrator is “Bob.Snively@Eng.Sun.COM” 

♦ 1394 Trade Association 


• httpi//www.flrewire.org 
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